
Welcome to the replication package for “Evaluating the Minority Candidate Penalty with a Regression Discontinuity Approach.” This package contains the necessary code and datasets to replicate the analyses, figures, and tables presented in the paper and SI.  

Package Contents

This replication package consists of four components. 

1. “dataprep_replicationscript.R”:  This script outlines the data preparation steps undertaken to construct the main dataset used for analysis. This script pulls in large proprietary datasets that we do not have permission to post publicly in their entirety, so we provide the code so that others can evaluate how we constructed the main dataset for analysis but the code cannot be run unless you also have the underlying datasets needed; they are not provided as part of this replication package. This script produces and saves the main analysis dataset used in this project, "fullRDDdataset20182020_analysis.csv," which is provided as part of the replication package. 

2. “figuresandtables_replication.R”: This script pulls in the main dataset and replicates all of the figures and tables featured in the main paper and SI. 

3. “fullRDDdataset20182020_analysis.csv”: This CSV file was produced by "dataprep_replicationscript.R” and is used by “figuresandtables_replication.R”. It is the foundation for all analyses, figures, and tables in the replication. 

4. "2016_PreBySLDist.csv": This dataset is specifically needed to replicate Figure A1 in the SI. You can see this dataset being used exclusively for this purpose in the "figuresandtables_replication.R" script. 


For any further questions, please contact Ariel White (arwhi@mit.edu). 
